Phoneme and Sentence-Level Ensembles for Speech Recognition
نویسندگان
چکیده
منابع مشابه
Phoneme and Sentence-Level Ensembles for Speech Recognition
We address the question of whether and how boosting and bagging can be used for speech recognition. In order to do this, we compare two different boosting schemes, one at the phoneme level, and one at the utterance level, with a phoneme level bagging scheme. We control for many parameters and other choices, such as the state inference scheme used. In an unbiased experiment, we clearly show that...
متن کاملThe Gamma MLP for Speech Phoneme Recognition
We define a Gamma multi-layer perceptron (MLP) as an MLP with the usual synaptic weights replaced by gamma filters (as proposed by de Vries and Principe (de Vries and Principe, 1992)) and associated gain terms throughout all layers. We derive gradient descent update equations and apply the model to the recognition of speech phonemes. We find that both the inclusion of gamma filters in all layer...
متن کاملClustering beyond phoneme contexts for speech recognition
The clustering of using decision trees is generalized to take into account high-level knowledge sources to better model the co-articulation e ects in large vocabulary continuous speech recognition. VQ models are used to reduce the computational cost in constructing decision trees. The search algorithm is designed such that it can provide a general type of information for decision trees without ...
متن کاملString-level MCE for continuous phoneme recognition
In this paper, we present results for the Minimum Classi cation Error (MCE) [1] framework for discriminative training applied to tasks in continuous phoneme recognition. The results obtained using MCE are compared with results for Maximum Likelihood Estimation (MLE). We examine the ability of MCE to attain high recognition performance with a small number of parameters. Phoneme-level and string-...
متن کاملEffects of presentation level on phoneme and sentence recognition in quiet by cochlear implant listeners.
OBJECTIVE The objectives of this study were to characterize the effects of presentation level on speech recognition in quiet by cochlear implant users with the Nucleus 22 SPEAK and Clarion v1.2 CIS speech-processing strategies, and to relate speech recognition at low presentation levels to stimulus audibility as measured by sound field thresholds. It was hypothesized that speech recognition per...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: EURASIP Journal on Audio, Speech, and Music Processing
سال: 2011
ISSN: 1687-4714,1687-4722
DOI: 10.1155/2011/426792